How to Use Temporal-Driven Constrained Clustering to Detect Typical Evolutions

نویسندگان

  • Marian-Andrei Rizoiu
  • Julien Velcin
  • Stéphane Lallich
چکیده

In this paper, we propose a new time-aware dissimilarity measure that takes into account the temporal dimension. Observations that are close in the description space, but distant in time are considered as dissimilar. We also propose a method to enforce the segmentation contiguity, by introducing, in the objective function, a penalty term inspired from the Normal Distribution Function. We combine the two propositions into a novel time-driven constrained clustering algorithm, called TDCK-Means, which creates a partition of coherent clusters, both in the multidimensional space and in the temporal space. This algorithm uses soft semi-supervised constraints, to encourage adjacent observations belonging to the same entity to be assigned to the same cluster. We apply our algorithm to a Political Studies dataset in order to detect typical evolution phases. We adapt the Shannon entropy in order to measure the entity contiguity, and we show that our proposition consistently improves temporal cohesion of clusters, without any significant loss in the multidimensional variance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatio-temporal patterns of crab fisheries in the main bays of Guangdong Province, China

  Using a semi-balloon otter trawl, crab fisheries in the main bays of Guangdong Province, China, were carried out seasonally . A total of 70 species were found, all belonging to the South China Sea Faunal sub region in the tropical India-West-Pacific Faunal Region. The clustering and nMDS ordination analysis revealed the existence of three groups. Group 1 included Hailing Bay and four bays to ...

متن کامل

Spatio-temporal patterns of crab fisheries in the main bays of Guangdong Province, China

  Using a semi-balloon otter trawl, crab fisheries in the main bays of Guangdong Province, China, were carried out seasonally . A total of 70 species were found, all belonging to the South China Sea Faunal sub region in the tropical India-West-Pacific Faunal Region. The clustering and nMDS ordination analysis revealed the existence of three groups. Group 1 included Hailing Bay and four bays to ...

متن کامل

Repeated Record Ordering for Constrained Size Clustering

One of the main techniques used in data mining is data clustering, which has many applications in computer science, biology, and social sciences. Constrained clustering is a type of clustering in which side information provided by the user is incorporated into current clustering algorithms. One of the well researched constrained clustering algorithms is called microaggregation. In a microaggreg...

متن کامل

Multi-scale Community Detection in Temporal Networks Using Spectral Graph Wavelets

Abstract Spectral graph wavelets introduce a notion of scale in networks, and are thus used to obtain a local view of the network from each node. By carefully constructing a wavelet filter function for these wavelets, a multi-scale community detection method for monoplex networks has already been developed. This construction takes advantage of the partitioning properties of the network Laplacia...

متن کامل

A Data-driven Method for Crowd Simulation using a Holonification Model

In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International Journal on Artificial Intelligence Tools

دوره 23  شماره 

صفحات  -

تاریخ انتشار 2014